Identifying Structure across Pre-partitioned Data
نویسندگان
چکیده
We propose an information-theoretic clustering approach that incorporates a pre-known partition of the data, aiming to identify common clusters that cut across the given partition. In the standard clustering setting the formation of clusters is guided by a single source of feature information. The newly utilized pre-partition factor introduces an additional bias that counterbalances the impact of the features whenever they become correlated with this known partition. The resulting algorithmic framework was applied successfully to synthetic data, as well as to identifying text-based cross-religion correspondences.
منابع مشابه
Privacy Preserving Association Rule Mining in Vertically Partitioned Data
Data mining technology has emerged as a means for identifying patterns and trends from large quantities of data. This paper presents privacy preserving association rule mining across vertically partitioned data. We present an efficient algorithm to discover association rules with minimum levels of support and confidence, from heterogeneous data distributed across 2 parties, while preventing eit...
متن کاملL2 Teachers’ Representations of Classroom Management Events: Variations across Experience Levels
Knowledge representation, defined as the way individuals structure their knowledge and cognitive processing of events and the associated sense-making processes, is believed to influence teachers’ reasoning/thinking skills. While extensively researched in mainstream teacher education, this line of inquiry is essentially lacking in the L2 teacher education literature. To fill some of the void, th...
متن کاملThe determinants of capital structure across firms’ sizes: The U.K evidence
This paper explores the leverage determinants across firms’ sizesbased on the two main theories behind the capital structure, the trade-offand the pecking order theories. A panel data is sued to find therelationship between capital structure and the variables that proxy forbenefits and costs of debt during 1990 to 2006. Our findings show thatboth principles help to explain the capital structure...
متن کاملEfficient Edge Noise Removal and Perceptual Feature Classification
Over-segmentation of edge features has been a challenging problem for many edge-based vision applications. Too many useless features are simply background noise which are costly for higher-level processing. The conventional methods of dealing with oversegmentation use various noise suppressing filters at pixel level for the entire image, and then form features by grouping identified edge points...
متن کاملتخمین وفقی مرز کلاتر در کلاترهای ویبول با استفاده از پیش آشکارساز UMPI
In radar detection, the existence of the clutter edge in the reference samples considerably degrades the performance of the detector. Hence, clutter edge estimation not only improves the CFAR detectors, but also can be used for partitioning the various areas of the clutter in the clutter map. In this paper, we propose an adaptive algorithm for detecting the clutter edge between two Weibull clut...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003